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[Name of the Document] Specification 

[Title of the Invention] A Method for Speech Coding, Method for Speech 

Decoding and their Apparatuses 

[Claims] 

[Claim 1] A speech coding method according to a code-excited 
linear prediction (CELP) speech coding method, comprising: 

evaluating a noise level of a speech in a concerning coding period by 
using a code or coding result of at least one of spectrum information, power 
information, and pitch information; and 

selecting a various excitation codebook based on an evaluation result. 

[Claim 2] The speech coding method of claim 1, further 
comprising: 

the plurality of excitation codebooks storing time series vectors with 
various noise levels; and 

switching the plurality of excitation codebooks based on the evaluation 
result of the noise level of the speech. 

[Claim 3] The speech coding method of claim 1, further 
comprising changing a noise level of time series vectors stored in the excitation 
codebooks based on the evaluation result of the noise level of the speech. 

[Claim 4] The speech coding method of claim 3, further 
comprising: 

an excitation codebook storing noise time series vectors; and 
generating a iow noise time series vector by sampling signal sampler in 

the time series vectors based on the evaluation result of the noise level of the 

speech. 
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[Claim 5] The speech coding method of claim 3, further 
comprising: 

a first excitation codebook storing a noise time series vector and a 
second excitation codebook storing a non-noise time series vector; and 
5 generating a time series vector by adding the time series vector in the 

first excitation codebook and the time series vector in the second excitation 
codebook by weighting based on the evaluation result of the noise level of the 
speech. 

[Claim 6] A speech decoding method according to a code-excited 
10 linear prediction (CELP) speech decoding method, comprising: 

evaluating a noise level of a speech in a concerning decoding period by 
using a code or decoding result of at least one of spectrum information, power 
information, and pitch information; and 

selecting one of a various excitation codebook based on an evaluation 

15 result. 

[Claim 7] The speech decoding method of claim 6, further 
comprising: 

the plurality of excitation codebooks storing time series vectors with 
various noise levels; and 
20 switching the plurality of excitation codebooks based on the evaluation 

result of the noise level of the speech. 

[Claim 8] The speech decoding method of claim 6, further 
comprising changing a noise level of time series vectors scored m uie exciiaaun 
codebooks based on the evaluation result of the noise level of the speech. 
25 [Claim 9] The speech decoding method of claim 8, further 



comprising: 

an excitation codebook storing noise time series vectors; and 
generating a low noise time series vector by sampling signal samples in 

the time series vectors based on the evaluation result of the noise level of the 

speech. 

[Claim 10] The speech decoding method of claim 8, further 
comprising: 

a first excitation codebook storing a noise time series vector and a 
second excitation codebook storing a non-noise time series vector; and 

generating time series vector by adding the code vector in the first 
excitation codebook and the time series vector in the second excitation 
codebook by weighting based on the evaluation result of the noise level of the 
speech. 

[Claim 11] A speech coding apparatus, comprising: 

a spectrum information encoding means for coding spectrum 
information of an input speech, and outputting a coded spectrum information 
as an element of a coding result; 

a noise level evaluating means for evaluating a noise level of a speech 
in a concerning coding period by using a code or coding result of at least one of 
spectrum information and power information, obtained from the coded 
spectrum information provided by the spectrum information encoding means, 
and outputting an evaluation result; 

a tirst excitation codebooic storing a plurality oi non-noi^e time merits 

vectors; 

a second excitation codebook storing a plurality of noise time series 



vectors; 

an excitation codebook switching means for switching the first 
excitation codebook and the second excitation codebook based on the 
evaluation result by the noise level evaluator; 

a weighting-adding means for weighting the time series vectors from 
the first excitation codebook and second excitation codebook depending on 
their gains; 

a synthesis filter for producing a coded speech based on an excitation 
signal, which is a weighted time series vector, and the coded spectrum 
information from the spectrum information encoding means; and 

a distance calculating means for calculating a distance between the 
coded speech and the input speech, searching an excitation code and gain for 
minimizing the distance, and outputting a result as an excitation code and a 
gain code as a coding result. 

[Claim 12] A speech decoding apparatus, comprising: 

a spectrum information decoding means for decoding a spectrum 
information code to spectrum information; 

a noise level evaluating means for evaluating a noise level of a speech 
in a concerning decoding period by using a decoding result or the spectrum 
information code of at least one of spectrum information and power 
information, obtained from decoded spectrum information provided by the 
spectrum information decoding means, and outputting an evaluation result; 

a iiibi excitation coaeoook sLormg <* piuxcoity oi aaoaa-^uao^ i^iie^ 

vectors; 

a second excitation codebook storing a plurality of noise time series 
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vectors; 

an excitation codebook switching means for switching the first 
excitation codebook and the second excitation codebook based on the 
evaluation result of the noise level evaluating means; 
5 a weighting-adding means for weighting the time series vectors from 

the first excitation codebook and second excitation codebook depending on 
their gains; and 

a synthesis filter for producing a decoded speech based on an excitation 
signal, which is a weighted time series vector, and the decoded spectrum 
10 information from the spectrum information decoding means. 
[Detailed Explanation of the Invention] 

[0001] 

[Technical Field of the Invention] 

This invention relates to methods for speech coding and decoding and 

15 apparatuses for speech coding and decoding for performing compression coding 
of a speech signal to a digital signal. Particularly, this invention relates to a 
method for speech coding, method for speech decoding, apparatus for speech 
coding and apparatus for speech decoding for reproducing a high quality 
speech at low bit rates. 

20 [0002] 
[Related Art] 

In the related art, code-excited linear prediction coding (Code-Excited 
Linear Prediction: {Jh,L,t) coding is well-known as an eiucient speecn cuonife 
method, and its technique is described in "Code-excited linear prediction 
25 (CELP): High-quality speech at 8kbps" (ICASSP '85, pp. 937 - 940, by M. R. 



Shroeder and B. S. Atal in 1985) . 
[0003] 

Fig. 6 illustrates an example of a whole configuration of a CELP speech 
coding and decoding method. In Fig. 6, an encoder 1, decoder 2, multiplexing 
means 3, and dividing means 4 are illustrated. 

The encoder 1 includes a linear prediction parameter analyzing means 
5, linear prediction parameter coding means 6, synthesis filter 7, adaptive 
codebook 8, excitation codebook 9, gain coding means 10, distance calculating 
means 11, and weighting-adding means 38. The decoder 2 includes a linear 
prediction parameter decoding means 12, synthesis filter 13, adaptive 
codebook 14, excitation codebook 15, gain decoding means 16, and weighting- 
adding means 39. 

[0004] 

In CELP speech coding, a speech in a frame of about 5-50 ms is 
divided into spectrum information and excitation information, and coded. 

Explanations are made on operations in the CELP speech coding 
method. In the encoder 1, the linear prediction parameter analyzing means 5 
analyzes an input speech Si, and extracts a linear prediction parameter, which 
is spectrum information of the speech. The linear prediction parameter 
coding means 6 codes the linear prediction parameter, and sets a coded linear 
prediction parameter as a coefficient for the syn thesis filter 7. 

[0005] 

Lxpianations are made on cooing oi excitation liuoimaaon. 
An old excitation signal is stored in the adaptive codebook 8. The 
adaptive codebook 8 outputs a time series vector, corresponding to an adaptive 
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code, which is generated by repeating the old excitation signal periodically. 

A plurality of time series vectors trained by reducing a distortion 
between a speech for training and its coded speech for example is stored in the 
excitation codebook 9. The excitation codebook 9 outputs a time series vector 
5 corresponding to an excitation code. 

Each of the time series vectors outputted from the adaptive codebook 8 
and excitation codebook 9 is weighted by using a respective gain provided by 
the gain coding means 10 and added by the weighting- adding means 38. 
Then, an addition result is provided to the synthesis filter 7 as excitation 
10 signals, and a coded speech is produced. The distance calculating means 11 
calculates a distance between the coded speech and the input speech Si, and 
searches an adaptive code, excitation code, and gains for minimizing the 
distance. When the above-stated coding is over, a linear prediction parameter 
code and the adaptive code, excitation code, and gain codes for minimizing a 
15 distortion between the input speech and the coded speech are outputted as a 
coding result. 

[0006] 

In the decoder 2, the linear prediction parameter decoding means 12 
decodes the linear prediction parameter code to the linear prediction 

20 parameter, and sets the linear prediction parameter as a coefficient for the 
synthesis filter 13. The adaptive codebook 14 outputs a time series vector 
corresponding to an adaptive code, which is generated by repeating an old 
excitation signal periodically, 'liie excitation coaeooox io outputs a tune 
series vector corresponding to an excitation code. The time series vectors are 

25 weighted by using respective gains, which are decoded from the gain codes by 
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the gain decoding means 16, and added by the weighting-adding means 39. 
An addition result is provided to the synthesis filter 13 as an excitation signal, 
and an output speech S3 is produced. 
[0007] 

5 Among the CELP speech coding and decoding method, an improved 

speech coding and decoding method for reproducing a high quality speech 
according to the related art is described in "Phonetically - based vector 
excitation coding of speech at 3.6 kbps" (ICASSP '89, pp. 49 - 52, by S. Wang 
and A. Gershoin 1989). 

10 Fig. 7 shows an example of a whole configuration of the speech coding 

and decoding method according to the related art, and same signs are used for 
means corresponding to the means in Fig. 6. 

In Fig. 7, the encoder 1 includes a speech state deciding means 17, 
excitation codebook switching means 18, first excitation codebook 19, and 

15 second excitation codebook 20. The decoder 2 includes an excitation codebook 
switching means 21, first excitation codebook 22, and second excitation 
codebook 23. 

Explanations are made on operations in the coding and decoding 
method in this configuration. In the encoder 1, the speech state deciding 

20 means 17 analyzes the input speech SI, and decides a state of the speech is 
which one of two states, e.g., voiced or unvoiced. The excitation codebook 
switching means 18 switches the excitation codebooks to be used in coding 
based on a speech state deciding result. For example, u ttie speecn is voicea, 
the first excitation codebook 19 is used, and if the speech is unvoiced, the 

25 second excitation codebook 20 is used. Then, the excitation codebook 
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switching means 18 codes which excitation codebook is used in coding. 
[0008] 

In the decoder 2, the excitation codebook switching means 21 switches 
the first excitation codebook 22 and the second excitation codebook 23 based on 
5 a code showing which excitation codebook was used in the encoder 1, so that 

the excitation codebook, which was used in the encoder 1, is used in the 
decoder 2. According to this configuration, excitation codebooks suitable for 
coding in various speech states are provided, and the excitation codebooks are 
switched based on a state of an input speech. Hence, a high quality speech 
10 can be reproduced. 

[0009] 

A speech coding and decoding method of switching a plurality of 
excitation codebooks without increasing a transmission bit number according 
to the related art is disclosed in Japanese Unexamined Published Patent 
15 Application 8 - 185198. The plurality of excitation codebooks is switched 
based on a pitch frequency selected in an adaptive codebook, and an excitation 
codebook suitable for characteristics of an input speech can be used without 
increasing transmission data. 
[0010] 

20 [Problems to be Solved by the Invention] 

As stated, in the speech coding and decoding method illustrated in Fig. 
6 according to the related art, a single excitation codebook is used to produce a 
synthetic speech. Non-noise time series vectors with many pulses snouia be 
stored in the excitation codebook to produce a high quality coded speech even 

25 at low bit rates. Therefore, when a noise speech, e.g., background noise, 
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fricative consonant, etc., is coded and synthesized, there is a problem that a 
coded speech produces an unnatural sound, e.g., "Jiri-Jiri" and "Chiri-Chiri." 
This problem can be solved, if the excitation codebook includes only noise time 
series vectors. However, in that case, a quality of the coded speech degrades 
5 as a whole. 

[0011] 

In the improved speech coding and decoding method illustrated in Fig. 
7 according to the related art, the plurality of excitation codebooks is switched 
based on the state of the input speech for producing a coded speech. 

10 Therefore, it is possible to use an excitation codebook including noise time 
series vectors in a noise period of the input speech and an excitation codebook 
including non-noise time series vectors in a voiced period other than the 
unvoiced noise period, for example. Hence, even if a noise speech is coded and 
synthesized, an unnatural sound, e.g., "Jiri-Jiri," is not produced. However, 

15 since the excitation codebook used in coding is also used in decoding, it 
becomes necessary to code and transmit data which excitation codebook was 
used. It becomes an obstacle for lowing bit rates. 
[0012] 

According to the speech coding and decoding method of switching the 
20 plurality of excitation codebooks without increasing a transmission bit number 
according to the related art, the excitation codebooks are switched based on a 
pitch period selected in the adaptive codebook. However, the pitch period 
selected in the adaptive codebook differs from an actual pitch period oi a 
speech, and it is impossible to decide if a state of an input speech is noise or 
25 non-noise only from a value of the pitch period. Therefore, the problem that 
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the coded speech in the noise period of the speech is unnatural cannot be 
solved. 

[0013] 

This invention was intended to solve the above-stated problems. 
Particularly, this invention aims at providing speech coding and decoding 
methods for reproducing a high quality speech even at low bit rates. 

[0014] 

[Means to Solve the Problems] 

In order to solve the above-stated problems, in a speech coding method 
according to this invention, a noise level of a speech in a concerning coding 
period is evaluated by using a code or coding result of at least one of spectrum 
information, power information, and pitch information, and a various 
excitation codebook is selected based on an evaluation result. 

[0015] 

In a speech coding method according to another invention, a plurality 
of excitation codebooks storing time series vectors with various noise levels is 
provided, and the plurality of excitation codebooks is switched based on an 
evaluation result of a noise level of a speech. 

[0016] 

In a speech coding method according to another invention, a noise level 
of time series vectors stored in an excitation codebook is changed based on an 
evaluation result of a noise level of a speech. 

[0017] 

In a speech coding method according to another invention, an excitation 
codebook storing noise time series vectors is provided. Alow noise time series 
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vector is generated by sampling signal samples in the time series vectors based 
on the evaluation result of a noise level of a speech. 
[0018] 

In a speech coding method according to another invention, a first 
5 excitation codebook storing a noise time series vector and a second excitation 

codebook storing a non-noise time series vector are provided. A code vector is 
generated by adding the times series vector in the first excitation codebook and 
the time series vector in the second excitation codebook by weighting based on 
an evaluation result of a noise level of a speech. 
10 [0019] 

In a speech decoding method according to another invention, a noise 
level of a speech in a concerning decoding period is evaluated by using a code or 
coding result of at least one of spectrum information, power information, and 
pitch information, and a various excitation codebook is selected based on an 
15 evaluation result. 

[0020] 

In a speech decoding method according to another invention, a 
plurality of excitation codebooks storing time series vectors with various noise 
levels is provided, and the plurality of excitation codebooks is switched based 
20 on an evaluation result of the noise level of the speech. 

[0021] 

In a speech decoding method according to another invention, noise 
levels of time series vectors stored in excitation codebooks are changed based 
on an evaluation result of the noise level of the speech. 
25 [0022] 
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In a speech decoding method according to another invention, an 
excitation codebook storing noise time series vectors is provided. A low noise 
time series vector is generated by sampling signal samples in the time series 
vectors based on the evaluation result of the noise level of the speech. 
5 [0023] 

In a speech decoding method according to another invention, a first 
excitation codebook storing a noise time series vector and a second excitation 
codebook storing a non-noise time series vector are provided. A code vector is 
generated by adding the times series vector in the first excitation codebook and 
10 the time series vector in the second excitation codebook by weighting based on 
an evaluation result of a noise level of a speech. 

[0024] 

A speech coding apparatus according to another invention includes a 
spectrum information encoding means for coding spectrum information of an 

15 input speech and outputting a coded spectrum information as an element of a 
coding result, a noise level evaluating means for evaluating a noise level of a 
speech in a concerning coding period by using a code or coding result of at least 
one of the spectrum information and power information, which is obtained 
from the coded spectrum information provided by the spectrum information 

20 encoding means, and outputting an evaluation result, a first excitation 
codebook storing a plurality of non-noise time series vectors, a second 
excitation codebook storing a plurality of noise time series vectors, an 
excitation codebook switching means for switching the tirst excitation codeoooic 
and the second excitation codebook based on the evaluation result by the noise 

25 level evaluating means, a weighting-adding means for weighting the time 
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series vectors from the first excitation codebook and second excitation codebook 
depending on their gains, a synthesis filter for producing a coded speech based 
on an excitation signal, which are weighted time series vectors, and the coded 
spectrum information provided by the spectrum information encoding means, 
5 and a distance calculating means for calculating a distance between the coded 

speech and the input speech, searching an excitation code and gain for 
minimizing the distance, and outputting a result as an excitation code, and a 
gain code as a coding result. 
[0025] 

10 A speech decoding apparatus according to another invention includes a 

spectrum information decoding means for decoding a spectrum information 
code to spectrum information, a noise level evaluating means for evaluating a 
noise level of a speech in a concerning decoding period by using a decoding 
result of at least one of the spectrum information and power information, 

15 which is obtained from decoded spectrum information provided by the 
spectrum information decoding means, and the spectrum information code and 
outputting an evaluating result, a first excitation codebook storing a plurality 
of non-noise time series vectors, a second excitation codebook storing a 
plurality of noise time series vectors, an excitation codebook switching means 

20 for switching the first excitation codebook and the second excitation codebook 
based on the evaluation result by the noise level evaluating means, a 
weighting-adding means for weighting the time series vectors from the first 
excitation codebook and the second excitation coaeoook depending on then 
gains, and a synthesis filter for producing a decoded speech based on an 

25 excitation signal, which is a weighted time series vector, and the decoded 
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spectrum information from the spectrum information decoding means. 

[0026] 
[Embodiments] 

Explanations are made on embodiments of this invention with 
5 reference to drawings. 
[0027] 
Embodiment 1. 

Same signs are used in Fig. 1 for corresponding elements of Fig. 7. 
Fig. 1 illustrates a whole configuration of a speech coding method and speech 

10 decoding method in embodiment 1 according to this invention. In Fig. 1, the 
linear prediction parameter analyzing means 5 is a spectrum information 
analyzing means for analyzing an input speech Si and extracting a linear 
prediction parameter^ which is spectrum information of the speech. The 
linear prediction parameter encoding means 6 is a spectrum information 

15 encoding means for coding the linear prediction parameter, which is the 
spectrum information and setting a coded linear prediction parameter as a 
coefficient for the synthesis filter 7. The first excitation codebooks 19 and 22 
store pluralities of non-noise time series vectors, and the second excitation 
codebooks 20 and 23 store pluralities of noise time series vectors. The noise 

20 level evaluating means 24 and 26 evaluate a noise level, and the excitation 
codebook switching means 25 and 27 switch the excitation codebooks based on 
the noise level. 
[0028] 

Operations are explained. 
25 In the encoder 1, the linear prediction parameter analyzing means 5 



19 

analyzes the input speech Si, and extracts a linear prediction parameter, 
which is spectrum information of the speech. The linear prediction parameter 
encoding means 6 codes the linear prediction parameter. Then, the linear 
prediction parameter encoding means 6 sets a coded linear prediction 
5 parameter as a coefficient for the synthesis filter 7, and also outputs the coded 
linear prediction parameter to the noise level evaluating means 24. 

Explanations are made on coding of excitation information. 

An old excitation signal is stored in the adaptive codebook 8, and a time 
series vector corresponding to an adaptive code, which is generated by 
10 repeating an old excitation signal periodically, is outputted. The noise level 
evaluating means 24 evaluates a noise level in a concerning coding period 
based on the coded linear prediction parameter inputted by the linear 
prediction parameter encoding means 6 and the adaptive code, e.g., a spectrum 
gradient, short-term prediction gain, and pitch fluctuation as shown in Fig. 2, 
15 and outputs an evaluation result to the excitation codebook switching means 
25. The excitation codebook switching means 25 switches excitation 
codebooks for coding based on the evaluation result of the noise level. For 
example, if the noise level is low, the first excitation codebook 19 is used, and if 
the noise level is high, the second excitation codebook 20 is used. 
20 [0029] 

The first excitation codebook 19 stores a plurality of non-noise time 
series vectors, e.g., a plurality of time series vectors trained by reducing a 
distortion Detween a ^peecn lor u'cuimjtg ana iu> toucu opccui. ±no ot^nu 
excitation codebook 20 stores a plurality of noise time series vectors, e.g., a 
25 plurality of time series vectors generated from random noises. Each of the 
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first excitation codebook 19 and the second excitation codebook 20 outputs a 
time series vector respectively corresponding to an excitation code. Each of 
the time series vectors from the adaptive codebook 8 and one of first excitation 
codebook 19 or second excitation codebook 20 are weighted by using a 
respective gain provided by the gain encoder 10, and added by the weighting- 
adding means 38. An addition result is provided to the synthesis filter 7 as 
excitation signals, and a coded speech is produced. The distance calculating 
means 11 calculates a distance between the coded speech and the input speech 
Si, and searches an adaptive code, excitation code, and gain for minimizing the 
distance. When this coding is over, the linear prediction parameter code and 
an adaptive code, excitation code, and gain code for minimizing the distortion 
between the input speech and the coded speech are outputted as a coding 
result S2. These are characteristic operations in the speech coding method in 
embodiment 1. 
[0030] 

Explanations are made on the decoder 2. In the decoder 2, the linear 
prediction parameter decoding means 12 decodes the linear prediction 
parameter code to the linear prediction parameter, and sets the decoded linear 
prediction parameter as a coefficient for the synthesis filter 13, and outputs the 
decoded linear prediction parameter to the noise level evaluating means 26. 

Explanations are made on decoding of excitation information. The 
adaptive codebook 14 outputs a time series vector corresponding to an adaptive 
code, winch is generated by repeating an old. excitation signal pvzioai^<uiy. 
The noise level evaluating means 26 evaluates a noise level by using the 
decoded linear prediction parameter inputted by the linear prediction 
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parameter decoding means 12 and the adaptive code in a same method with 
the noise level evaluating means 24 in the encoder 1, and outputs an 
evaluation result to the excitation codebook switching means 27. The 
excitation codebook switching means 27 switches the first excitation codebook 
5 22 and the second excitation codebook 23 based on the evaluation result of the 
noise level in a same method with the excitation codebook switching means 25 
in the encoder 1. 
[0031] 

A plurality of non-noise time series vectors, e.g., a plurality of time 
10 series vectors generated by training for reducing a distortion between a speech 
for training and its coded speech, is stored in the first excitation codebook 22. 
A plurality of noise time series vectors, e.g., a plurality of vectors generated 
from random noises, is stored in the second excitation codebook 23. Each of 
the first and second excitation codebooks outputs a time series vector 
15 respectively corresponding to an excitation code. The time series vectors from 
the adaptive codebook 14 and one of excitation codebook 22 or excitation 
codebook 23 are weighted by using respective gains, decoded from gain codes 
by the gain decoding means 16, and added by the weighting-adding means 39. 
An addition result is provided to the synthesis filter 13 as an excitation signal, 
20 and an output speech S3 is produced. These are operations are characteristic 
operations in the speech decoding method in embodiment 1. 
[0032] 

in embodiment 1, the noise level ot tne input speecn i^ evaiuaiea o> 
using the code and coding result, and various excitation codebooks are used 
25 based on the evaluation result. Therefore, a high quality speech can be 
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reproduced with a small data amount. 

[0033] 
Embodiment 2. 

In embodiment 1, two excitation codebooks are switched. However, it 
5 is also possible that three or more excitation codebooks are provided and 

switched based on a noise level. 

In embodiment 2, a suitable excitation codebook can be used even for a 
medium speech, e.g., slightly noisy, in addition to two kinds of speech, i.e., 
noise and non-noise. Therefore, a high quality speech can be reproduced. 
10 [0034] 
Embodiment 3. 

Fig. 3 shows a whole configuration of a speech coding method and 
speech decoding method in embodiment 3 of this invention. In Fig. 3, same 
signs are used for units corresponding to the units in Fig. 1. In Fig. 3, 
15 excitation codebooks 28 and 30 store noise time series vectors, and sampling 
means 29 and 31 set an amplitude value of a sample with a low amplitude in 
the time series vectors to zero. 

[0035] 

Operations are explained. In the encoder 1, the linear prediction 
20 parameter analyzing means 5 analyzes the input speech Si, and extracts a 
linear prediction parameter, which is spectrum information of the speech. 
The linear prediction parameter encoding means 6 codes the linear prediction 
parameter. Tnen, tfre linear prediction parameter eiicouiiife mean^ b s^o <x 
coded linear prediction parameter as a coefficient for the synthesis filter 7, and 
25 also outputs the coded linear prediction parameter to the noise level evaluating 
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means 24. 

Explanations are made on coding of excitation information. An old 
excitation signal is stored in the adaptive codebook 8, and a time series vector 
corresponding to an adaptive code, which is generated by repeating an old 
5 excitation signal periodically, is outputted. The noise level evaluating means 
24 evaluates a noise level in a concerning coding period by using the coded 
linear prediction parameter, which is inputted from the linear prediction 
parameter encoding means 6, and an adaptive code, e.g., a spectrum gradient, 
short-term prediction gain, and pitch fluctuation, and outputs an evaluation 
10 result to the sampling means 29. 

[0036] 

The excitation codebook 28 stores a plurality of time series vectors 
generated from random noises, for example, and outputs a time series vector 
corresponding to an excitation code. If the noise level is low in the evaluation 

15 result of the noise, the sampling means 29 outputs a time series vector, in 
which an amplitude of a sample with an amplitude below a determined value 
in the time series vectors, inputted from the excitation codebook 28, is set to 
zero, for example. If the noise level is high, the sampling means 29 outputs 
the time series vector inputted from the excitation codebook 28 without 

20 modification. Each of the times series vectors from the adaptive codebook 8 

and the sampling means 29 is weighted by using a respective gain provided by 
the gain encoder 10 and added by the weighting- adding means 38. An 
addition result is provided to the synthesis later '/ as excitation signals, ana a 
coded speech is produced. The distance calculating means 11 calculates a 

25 distance between the coded speech and the input speech Si, and searches an 
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adaptive code, excitation code, and gain for minimizing the distance. When 
coding is over, the linear prediction parameter code and the adaptive code, 
excitation code, and gain code for minimizing a distortion between the input 
speech and the coded speech are outputted as a coding result S2. These are 
5 characteristic operations in the speech coding method in embodiment 3. 
[0037] 

Explanations are made on the decoder 2. In the decoder 2, the linear 
prediction parameter decoding means 12 decodes the linear prediction 
parameter code to the linear prediction parameter. The linear prediction 
10 parameter decoding means 12 sets the linear prediction parameter as a 
coefficient for the synthesis filter 13, and also outputs the linear prediction 
parameter to the noise level evaluating means 26. 

Explanations are made on decoding of excitation information. The 
adaptive codebook 14 outputs a time series vector corresponding to an adaptive 
15 code, generated by repeating an old excitation signal periodically. The noise 
level evaluating means 26 evaluates a noise level by using the decoded linear 
prediction parameter inputted from the linear prediction parameter decoding 
means 12 and the adaptive code in a same method with the noise level 
evaluating means 24 in the encoder 1, and outputs an evaluation result to the 
20 sampling means 3 1 . 

[0038] 

The excitation codebook 30 outputs a time series vector corresponding 
to an excitation code. 'ine sampling means bl outputs a time series veu^ 
based on the evaluation result of the noise level in same processing with the 
25 sampling means 29 in the encoder 1. Each of the time series vectors 
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outputted from the adaptive codebook 14 and sampling means 31 are weighted 
by using a respective gain provided by the gain decoder 16, and added by the 
weighting-adding means 39. An addition result is provided to the synthesis 
filter 13 as an excitation signal, and an output speech S3 is produced. 
[0039] 

In embodiment 3, the excitation codebook storing noise time series 
vectors is provided, and an excitation with a low noise level can be generated 
by sampling excitation signal samples based on an evaluation result of the 
noise level the speech. Hence, a high quality speech can be reproduced with a 
small data amount. Further, since it is not necessary to provide a plurality of 
excitation codebooks, a memory amount for storing the excitation codebook can 
be reduced. 

[0040] 
Embodiment 4. 

In embodiment 3, the samples in the time series vectors are either 
sampled or not. However, it is also possible to change a threshold value of an 
amplitude for sampling the samples based on the noise level. In embodiment 
4, a suitable time series vector can be generated and used also for a medium 
speech, e.g., slightly noisy, in addition to the two types of speech, i.e., noise and 
non-noise. Therefore, a high quality speech can be reproduced. 

[0041] 
Embodiment 5. 

Fig. 4 shows a whole configuration ot a speech coding method and a 
speech decoding method in embodiment 5 of this invention, and same signs are 
used for units corresponding to the units in Fig. 1. 
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In Fig. 4, first excitation codebooks 32 and 35 store noise time series 
vectors, and second excitation codebooks 33 and 36 store non-noise time series 
vectors. The weight determining means 34 and 37 are also illustrated. 

[0042] 

5 Operations are explained. In the encoder 1, the linear prediction 

parameter analyzing means 5 analyzes the input speech Si, and extracts a 
linear prediction parameter, which is spectrum information of the speech. 
The linear prediction parameter encoding means 6 codes the linear prediction 
parameter. Then, the linear prediction parameter encoding means 6 sets a 
10 coded linear prediction parameter as a coefficient for the synthesis filter 7, and 
also outputs the coded prediction parameter to the noise level evaluating 
means 24. 

Explanations are made on coding of excitation information. The 
adaptive codebook 8 stores an old excitation signal, and outputs a time series 
15 vector corresponding to an adaptive code, which is generated by repeating an 
old excitation signal periodically. The noise level evaluating means 24 
evaluates a noise level in a concerning coding period by using the coded linear 
prediction parameter, which is inputted from the linear prediction parameter 
encoding means 6 and the adaptive code, e.g., a spectrum gradient, short-term 
20 prediction gain, and pitch fluctuation, and outputs an evaluation result to the 
weight determining means 34. 

[0043] 

The first excitation codebook 32 stores a plurality ot noise time series 
vectors generated from random noises, for example, and outputs a time series 
25 vector corresponding to an excitation code. The second excitation codebook 33 
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stores a plurality of time series vectors generated by training for reducing a 
distortion between a speech for training and its coded speech, and outputs a 
time series vector corresponding to an excitation code. The weight 
determining means 34 determines a weight provided to the time series vector 
from the first excitation codebook 32 and the time series vector from the second 
excitation codebook 33 based on the evaluation result of the noise level 
inputted from the noise level evaluating means 24, as illustrated in Fig. 5, for 
example. Each of the time series vectors from the first excitation codebook 
32 and the second excitation codebook 33 is weighted by using the weight 
provided by the weight determining means 34, and added. The time series 
vector outputted from the adaptive codebook 8 and the time series vector, 
which is generated by being weighted and added, are weighted by using 
respective gains provided by the gain encoding means 10, and added by the 
weighting-adding means 38. Then, an addition result is provided to the 
synthesis filter 7 as excitation signals, and a coded speech is produced. The 
distance calculating means 11 calculates a distance between the coded speech 
and the input speech SI, and searches an adaptive code, excitation code, and 
gain for minimizing the distance. When coding is over, the linear prediction 
parameter code, adaptive code, excitation code, and gain code for minimizing a 
distortion between the input speech and the coded speech, are outputted as a 
coding result. 
[0044] 

Explanations are made on the decoder 2. In the decoder 2, the linear 
prediction parameter decoding means 12 decodes the linear prediction 
parameter code to the linear prediction parameter. Then, the linear 
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prediction parameter decoding means 12 sets the linear prediction parameter 
as a coefficient for the synthesis filter 13, and also outputs the linear prediction 
parameter to the noise evaluating means 26. 

Explanations are made on decoding of excitation information. The 
adaptive codebook 14 outputs a time series vector corresponding to an adaptive 
code by repeating an old excitation signal periodically. The noise level 
evaluating means 26 evaluates a noise level by using the decoded linear 
prediction parameter, which is inputted from the linear prediction parameter 
decoding means 12, and the adaptive code in a same method with the noise 
level evaluating means 24 in the encoder 1, and outputs an evaluation result to 
the weight determining means 37. 

[0045] 

The first excitation codebook 35 and the second excitation codebook 36 
output time series vectors corresponding to excitation codes. The weight 
determining means 37 weights based on the noise level evaluation result 
inputted from the noise level evaluating means 26 in a same method with the 
weight determining means 34 in the encoder 1. Each of the time series 
vectors from the first excitation codebook 35 and the second excitation 
codebook 36 is weighted by using a respective weight provided by the weight 
determining means 37, and added. The time series vector outputted from the 
adaptive codebook 14 and the time series vector, which is generated by being 
weighted and added, are weighted by using respective gains decoded from the 
gain codes by the gain decoding means 16, and added oy the weighting-adding 
means 39. Then, an addition result is provided to the synthesis filter 13 as an 
excitation signal, and an output speech S3 is produced. 
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[0046] 

In embodiment 5, the noise level of the speech is evaluated by using a 
code and coding result, and the noise time series vector or non-noise time 
series vector are weighted based on the evaluation result, and added. 
5 Therefore, a high quality speech can be reproduced with a small data amount. 

[0047] 
Embodiment 6. 

In embodiments 1 - 5, it is also possible to change gain codebooks 
based on the evaluation result of the noise level. In embodiment 6, a most 
10 suitable gain codebook can be used based on the excitation codebook. 

Therefore, a high quality speech can be reproduced. 

[0048] 
Embodiment 7. 

In embodiments 1-6, the noise level of the speech is evaluated, and 
15 the excitation codebooks are switched based on the evaluation result. 

However, it is also possible to decide and evaluate each of a voiced onset, 
plosive consonant, etc., and switch the excitation codebooks based on an 
evaluation result. In embodiment 7, in addition to the noise state of the 
speech, the speech is classified in more details, e.g., voiced onset, plosive 
20 consonant, etc., and a suitable excitation codebook can be used for each state. 
Therefore, a high quality speech can be reproduced. 
[0049] 

[Effects of the Invention] 

In the speech coding method, speech decoding method, speech coding 
25 apparatus, and speech decoding apparatus according to this invention, a noise 
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level of a speech in a concerning coding period is evaluated by using a code or 
coding result of at least one of the spectrum information, power information, 
and pitch information, and various excitation codebooks are used based on the 
evaluation result. Therefore, a high quality speech can be reproduced with a 
5 small data amount. 

[0050] 

In the speech coding method and speech decoding method according to 
this invention, a plurality of excitation codebooks storing excitations with 
various noise levels is provided, and the plurality of excitation codebooks is 
10 switched based on the evaluation result of the noise level of the speech. 
Therefore, a high quality speech can be reproduced with a small data amount. 

[0051] 

In the speech coding method and speech decoding method according to 
this invention, the noise levels of the time series vectors stored in the 
15 excitation codebooks are changed based on the evaluation result of the noise 
level of the speech. Therefore, a high quality speech can be reproduced with a 
small data amount. 

[0052] 

In the speech coding method and speech decoding method according to 
20 this invention, an excitation codebook storing noise time series vectors is 
provided, and a time series vector with a low noise level is generated by 
sampling signal samples in the time series vectors based on the evaluation 
result of the noise level of the speech. Therefore, a high quality speech can be 
reproduced with a small data amount. 
25 [0053] 
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In the speech coding method and speech decoding method according to 
this invention, the first excitation codebook storing noise time series vectors 
and the second excitation codebook storing non-noise time series vectors are 
provided, and the time series vector in the first excitation codebook or the time 
5 series vector in the second excitation codebook is weighted based on the 
evaluation result of the noise level of the speech, and added to generate a time 
series vector. Therefore, a high quality speech can be reproduced with a small 
data amount. 

[Brief Description of the Drawings] 
10 [Fig- 1] Fig. 1 shows a block diagram of a whole configuration of a 

speech coding and speech decoding apparatus in embodiment 1 of this 
invention. 

[Fig. 2] Fig. 2 shows a table for explaining an evaluation of a noise level 
in embodiment 1 of this invention illustrated in Fig. 1. 
15 [Fig. 3] Fig. 3 shows a block diagram of a whole configuration of a 

speech coding and speech decoding apparatus in embodiment 3 of this 
invention. 

[Fig. 4] Fig. 4 shows a block diagram of a whole configuration of a 
speech coding and speech decoding apparatus in embodiment 5 of this 
20 invention. 

[Fig. 5] Fig. 5 shows a schematic line chart for explaining a decision 
process of weighting in embodiment 5 illustrated in Fig. 4. 

[Fig. 6] Fig. 6 shows a block diagram of a whole coniiguration oi a 
CELP speech coding and decoding apparatus according to the related art. 
25 [Fig. 7] Fig. 7 shows a block diagram of a whole configuration of an 



32 

improved CELP speech coding and decoding apparatus according to the related 
art. 

[Explanations of Signs] 

1: encoder, 2: decoder, 3: multiplexing means, 4: dividing means, 5: linear 
5 prediction parameter analyzing means, 6: linear prediction parameter coding 
means, 7, 13: synthesis filter, 8, 14: adaptive codebook, 9, 15: excitation 
codebook, 10: gain coding means, 11: distance calculating means, 12: linear 
prediction parameter decoding means, 16: gain decoding means, 17: speech 
state deciding means, 18, 21: excitation codebook switching means, 19, 22: first 
10 excitation codebook, 20, 23: second excitation codebook, 24, 26: excitation 
codebook switching means, 25, 27: excitation codebook switching means 28, 30: 
excitation codebook, 29, 31: sampling means, 32, 35: first excitation codebook 
33, 36: second excitation codebook, 34, 37: weight determining means, 38, 39: 
weighting-adding means 
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[Document Name] abstract 
[Abstract] 

[Aim] A high quality speech is reproduced with a small data amount in 
speech coding and decoding for performing compression coding and decoding of 
5 a speech signal to a digital signal. 

[Means to Solve] In speech coding method according to a code-excited linear 
prediction (CELP) speech coding, a noise level of a speech in a concerning 
coding period is evaluated by using a code or coding result of at least one of 
spectrum information, power information, and pitch information, and various 
10 excitation codebooks are used based on an evaluation result 
[Selected Figure] Fig. 1 
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Fig.3 
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Fig. 5 
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Fig.6 
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Fig. 7 
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